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DATE: 03/05/2002 
TIME: 13:59:07 



Input Set : A:\Salk2270.app 

Output Set: N:\CRF3\03052002\I840008.raw 
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<110> APPLICANT: EVANS, RONALD M. 

<12 0> TITLE OF INVENTION: XENOBIOTIC COMPOUND MODULATED EXPRESSION SYSTEMS AND 
USES THEREFOR 
FILE REFERENCE: SALK22 70-4 
CURRENT APPLICATION NUMBER: 09/840,008 
CURRENT FILING DATE: 2001-04-20 
PRIOR APPLICATION NUMBER: 09/458,366 
PRIOR FILING DATE: 1999-12-09 
PRIOR APPLICATION NUMBER: 09/005,286 
PRIOR FILING DATE: 1998-01-09 
NUMBER OF SEQ ID NOS : 4 3 

1 



<130> 
<140> 
<141> 
<150> 
<151> 
<150> 
<151> 
<160> 
<170> 
<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<222> 
<220> 
<221> 
<222> 
<223> 
<400> 



SOFTWARE: Patentln Ver . 
SEQ ID NO: 1 
LENGTH: 2 06 8 
TYPE : DNA 

Homo sapiens 



ORGANISM: 
FEATURE : 
NAME/KEY : 
LOCATION: 
FEATURE : 
NAME/KEY : 
LOCATION: 
OTHER INFORMATION 
SEQUENCE : 1 



CDS 

(583) . . (1884) 

modif ied_base 
(1263) 

a, c, 



t, or g 



ggcacgagga gatctaggtt caaattaatg ttgcccctag tggtaaagga cagagaccct 
cagactgatg aaatgcgctc agaattactt agacaaagcg gatatttgcc actctcttcc 
ccttttcctg tgtttttgta gtgaagagac ctgaaagaaa aaagtaggga gaacataatg 
agaacaaata cggtaatctc ttcatttgct agttcaagtg ctggacttgg gacttaggag 
gggcaatgga gccgcttagt gcctacatct gacttggact gaaatatagg tgagagacaa 
gattgtctca tatccgggga aatcataacc tatgactagg acgggaagag gaagcactgc 
ctttacttca gtgggaatct cggcctcagc ctgcaagcca agtgttcaca gtgagaaaag 
caagagaata agctaatact cctgtcctga acaaggcagc ggctccttgg taaagctact 
ccttgatcga tcctttgcac cggattgttc aaagtggacc 
aagaacttac caccaagcag tccaagaggc ccagaagcaa 



ccc aaa gaa 

Pro Lys Glu 
5 

gag tct gtt 

65 Glu Ser Val 
66 

68 ggt ccc caa 



age tgg aac cat get gac ttt gta 
Ser Trp Asn His Ala Asp Phe Val 
10 15 
cct gga aag ccc agt gtc aac gca 
Pro Gly Lys Pro Ser Val Asn Ala Asp Glu Glu Val Gly 

25 30 35 

ate tgc cgt gta tgt ggg gac aag gec act ggc tat cac 



ccaggggaga agteggagea 
ac ctg gag gtg aga 
Leu Glu Val Arg 
1 

cac tgt gag gac aca 
His Cys Glu Asp Thr 
20 

gat gag gaa gtc gga 



60 

120 

180 

240 

300 

360 

420 

480 

54 0 

594 



642 



690 



738 
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PATENT APPLICATION: US/09/840 f 008 TIME: 13:59:07 
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6 9 Gly Pro Gin lie Cys Arg Val Cys Gly Asp Lys Ala Thr Gly Tyr His 
70 40 45 50 

72 ttc aat gtc atg aca tgt gaa gga tgc aag ggc ttt ttc agg agg gcc 786 

73 Phe Asn Val Met Thr Cys Glu Gly Cys Lys Gly Phe Phe Arg Arg Ala 

74 55 60 65 

76 atg aaa cgc aac gcc egg ctg agg tgc ccc ttc egg aag ggc gcc tgc 834 

77 Met Lys Arg Asn Ala Arg Leu Arg Cys Pro Phe Arg Lys Gly Ala Cys 

78 70 75 80 

80 gag ate acc egg aag ace egg cga cag tgc cag gcc tgc cgc ctg cgc 882 

81 Glu lie Thr Arg Lys Thr Arg Arg Gin Cys Gin Ala Cys Arg Leu Arg 

82 85 90 95 100 

84 aag tgc ctg gag age ggc atg aag aag gag atg ate atg tec gac gag 930 

85 Lys Cys Leu Glu Ser Gly Met Lys Lys Glu Met lie Met Ser Asp Glu 

86 105 110 115 

88 gcc gtg gag gag agg egg gcc ttg ate aag egg aag aaa agt gaa egg 978 
.89 Ala Val Glu Glu Arg Arg Ala Leu lie Lys Arg Lys Lys Ser Glu Arg 
90 120 125 130 

92 aca ggg act cag cca ctg gga gtg cag ggg ctg aca gag gag cag egg 1026 

93 Thr Gly Thr Gin Pro Leu Gly Val Gin Gly Leu Thr Glu Glu Gin Arg 

94 135 140 145 

96 atg atg ate agg gag ctg atg gac get cag atg aaa acc ttt gac act 1074 

97 Met Met lie Arg Glu Leu Met Asp Ala Gin Met Lys Thr Phe Asp Thr 

98 150 155 160 

1122 



1170 





100 


acc 


ttc 


tec 


cat 


ttc 


aag 


aat 


ttc 


egg 


ctg 


cca 


ggg 


gtg 


ctt 


age 


agt 




101 


Thr 


Phe 


Ser 


His 


Phe 


Lys 


Asn 


Phe 


Arg 


Leu 


Pro 


Gly 


Val 


Leu 


Ser 


Ser 




102 


165 










170 










175 










180 




104 


ggc 


tgc 


gag 


ttg 


cca 


gag 


cct 


ctg 


cag 


gcc 


cca 


teg 


agg 


gaa 


gaa 


get 




105 


Gly 


Cys 


Glu 


Leu 


Pro 


Glu 


Pro 


Leu 


Gin 


Ala 


Pro 


Ser 


Arg 


Glu 


Glu 


Ala 




106 










185 










190 










195 






108 


gcc 


aag 


tgg 


age 


cag 


gtc 


egg 


aaa 


gat 


ctg 


tgc 


tct 


ttg 


aag 


gtc 


tct 




109 


Ala 


Lys 


Trp 


Ser 


Gin 


Val 


Arg 


Lys 


Asp 


Leu 


Cys 


Ser 


Leu 


Lys 


Val 


Ser 




110 








200 










205 










210 






w--> 


112 


ctg 


caa 


get 


gcg 


ggg 


gga 


gga 


tgg 


cag 


tgt 


ctg 


gaa 


eta 


caa 


acn 


ccc 


w--> 


113 


Leu 


Gin 


Ala 


Ala 


Gly 


Gly 


Gly 


Trp 


Gin 


Cys 


Leu 


Glu 


Leu 


Gin 


Xaa 


Pro 




114 






215 










220 










225 










116 


age 


cga 


cag 


tgg 


egg 


aaa 


gag 


ate 


ttc 


tec 


ctg 


ctg 


ccc 


cac 


atg 


get 




117 


Ser 


Arg 


Gin 


Trp 


Arg 


Lys 


Glu 


He 


Phe 


Ser 


Leu 


Leu 


Pro 


His 


Met 


Ala 




118 




230 










235 










240 












120 


gac 


atg 


tea 


acc 


tac 


atg 


ttc 


aaa 


ggc 


ate 


ate 


age 


ttt 


gcc 


aaa 


gtc 




121 


Asp 


Met 


Ser 


Thr 


Tyr 


Met 


Phe 


Lys 


Gly 


He 


He 


Ser 


Phe 


Ala 


Lys 


Val 




122 


245 










250 










255 










260 




124 


ate 


tec 


tac 


ttc 


agg 


gac 


ttg 


ccc 


ate 


gag 


gac 


cag 


ate 


tec 


ctg 


ctg 




125 


lie 


Ser 


Tyr 


Phe 


Arg 


Asp 


Leu 


Pro 


He 


Glu 


Asp 


Gin 


He 


Ser 


Leu 


Leu 




126 










265 










270 










275 






128 


aag 


ggg 


gcc 


get 


ttc 


gag 


ctg 


tgt 


caa 


ctg 


aga 


ttc 


aac 


aca 


gtg 


ttc 




129 


Lys 


Gly 


Ala 


Ala 


Phe 


Glu 


Leu 


Cys 


Gin 


Leu 


Arg 


Phe 


Asn 


Thr 


Val 


Phe 




130 








280 










285 










290 








132 


aac 


gcg 


gag 


act 


gga 


acc 


tgg 


gag 


tgt 


ggc 


egg 


ctg 


tec 


tac 


tgc 


ttg 




133 


Asn 


Ala 


Glu 


Thr 


Gly 


Thr 


Trp 


Glu 


Cys 


Gly 


Arg 


Leu 


Ser 


Tyr 


Cys 


Leu 



1218 



1266 



1314 



1362 



1410 



1458 



1506 
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1 "2 A 

± J4 
































lob 


gaa gac act 


gca 


ggt 


ggc 


4- +■ 


cag 


caa 


ctt 




ctg 


gag 


ccc 


ang 


ctg 


i DD4 


1 77 


f 1 1 1 Aon T 1 Vi Y~ 
blU rA 0 1 111 




\J JL y 






CZ 1 n 


Vj JL 11 


Leu 


Leu 


Leu 


CZ 1 n 


XT 1 \J 


i v i e i_ 


Leu 




1 jO 


J JL \J 








Oi. 3 










Ton 














a a. a. ttc cac 


+■ a 


atg 


ctg 


aag 


aag 


ctg 


cag 


ctg 


ca u 


gag 


gag 


gag 




1 c no 


1 A 1 

JL H ± 


Ljrb Jr lit: nib 


T~It 7 -»~ 

lyr 


1*1 c L- 


Leu 


Lys 


Lys 


Leu 


PI n 

Vj 111 


Leu 


n 1 0 




(Z 1 11 




xyr 






7 O Ci 
















*3 c. 










O ft V 




1 A A 
± 4 ft 


y cy ctg atg 


cag 


gec 


alC 


tec 


etc 


cue 


tec 


eea 


ga c 


cgc 


cca 


ggt 


gtg 


1 & 


J. ft D 


V ct JL J_itS U. ITie C 


(Z 1 n 

Hi 


A I a 
rt.1 Cl 


11c 


Ca v 

O " _L 


Leu. 


IT 11CJ 


DC1 


rr X KJ 


Asp 


Arg 


ri 




V Cl 1 










7A ^ 
jfi D 










■a cri 
JDU 










R 






± ft O 


uLy Lay cctc 


eg c 


gtg 


gtg 


ga c 


cag 


ctg 


cag 


gag 


caa 




gec 


a L L 


cl U 




14 9 


TiPii f^l n Hi c« 

UC LA nJT J_ 1 1 11 _L O 


Arg 


Val 


Val 


Asp 


Gin 


Leu 


Gin 


Glu 


Gin 


Phe 


Ala 


He 


Thr 




1 jU 




7 C A 










ODD 










7 7 n 








1 R7 


4- r-r ~\ -~\ r*r 4"- « 

cuy day tec 


UaC 




gaa 


tgc 


aaL 


egg 


ccc 


cag 


CCL 


get 




agg 


4- 4- y-, 

L. CO 


1 7AC 

JL A ft D 


1 R7 


Leu Lys Ser 


Tyr 


He 


Glu 


Cys 


Asn 


Arg 


Pro 


Gin 


Pro 


Ala 


His 


Arg 


Phe 




1 RA 
J. J *r 


375 










380 










385 










1 JU 


ttg ttc ctg 


aag 


ate 


atg 


get 


atg 


etc 


ace 


gag 


etc 


cgc 


age 


ate 


aat 


1 794 


1 c.7 

ID / 


Leu Phe Leu 


Lys 


He 


Met 


Ala 


Met 


Leu 


Thr 


Glu 


Leu 


Arg 


Ser 


He 


Asn 




1 DO 


390 








395 










400 














get cag cac 


ace 


cag 


egg 


ctg 


ctg 


cgc 


ate 


cag 


gac 


ata 


cac 


ccc 


ttt 


1 ft A 0 
JL O ft Z 


i c 1 


Ala Gin His 


Thr 


Gin 


Arg 


Leu 


Leu 


Arg 


He 


Gin 


Asp 


He 


His 


Pro 


Phe 




ICO 


405 






410 










415 










420 




1 C A 


get acg ccc 


etc 


atg 


cag 


gag 


ttg 


ttc 


ggc 


ate 


aca 


ggt 


age 






1 ft ft A 
10O4 


ICC 

JL O 3 


Ala Thr Pro 


Leu 


Met 


Gin 


Glu 


Leu 


Phe 


Gly 


He 


Thr 


Gly 


Ser 








ICC 






425 










430 
















ICQ 

loo 


tgagcggctg ccttgggtga caccttcgag aggcagecag 


acccagagcc < 


stetgagecg 


1 Q A A 
JL :? 4 4 


1 7 n 


gcactcccgg gecaagacag atggacactg ccaagagccg 


acaatgccct gctggcctgt 


70 DA 


17 7 


ctccctaggg aattcctget atgacagctg gctagcattc 


ctcaggaagg acatggggtg 


7 n C A 

Z U D ft 


1 7 A 
1/4 


cccc 




























7 0 C ft 


17 7 


<210> SEQ ID NO: 


: 2 


























17ft 
I/O 


<211> LENGTH: 434 


























1 7Q 


<212> TYPE: 


PRT 




























1 pn 


<213> ORGANISM: 


Homo sapiens 




















1 


<220> FEATURE: 




























1 QQ 

IOj 


<221> NAME/KEY: 


MOD_ 


_RES 
























1 P4 
_L O ft 


<222> LOCATION: 


(227) 
























1 oc 
IOJ 


<22 3> OTHER 


INFORMATION: 


: Threonine 


















1 R7 


<400> SEQUENCE: 


2 


























188 


Leu Glu Val 


Arg 


Pro 


Lys 


Glu 


Ser 


Trp 


Asn 


His 


Ala 


Asp 


Phe 


Val 


His 




189 


1 




5 










10 










15 






191 


Cys Glu Asp 


Thr 


Glu 


Ser 


Val 


Pro 


Gly 


Lys 


Pro 


Ser 


Val 


Asn 


Ala 


Asp 




192 




20 










25 










30 








194 


Glu Glu Val 


Gly 


Gly 


Pro 


Gin 


He 


Cys 


Arg 


Val 


Cys 


Gly 


Asp 


Lys 


Ala 




195 


35 










40 










45 










197 


Thr Gly Tyr 


His 


Phe 


Asn 


Val 


Met 


Thr 


Cys 


Glu 


Gly 


Cys 


Lys 


Gly 


Phe 




198 


50 








55 










60 












200 


Phe Arg Arg 


Ala 


Met 


Lys 


Arg 


Asn 


Ala 


Arg 


Leu 


Arg 


Cys 


Pro 


Phe 


Arg 




201 


65 






70 










75 










80 




203 


Lys Gly Ala 


Cys 


Glu 


He 


Thr 


Arg 


Lys 


Thr 


Arg 


Arg 


Gin 


Cys 


Gin 


Ala 
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DATE: 03/05/2002 
TIME: 13:59:07 
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204 



85 



90 



95 



206 Cys Arg Leu Arg Lys Cys Leu Glu Ser Gly Met Lys Lys Glu Met lie 

207 100 105 110 

2 09 Met Ser Asp Glu Ala Val Glu Glu Arg Arg Ala Leu lie Lys Arg Lys 
210 115 120 125 

212 Lys Ser Glu Arg Thr Gly Thr Gin Pro Leu Gly Val Gin Gly Leu Thr 

213 130 135 140 

215 Glu Glu Gin Arg Met Met lie Arg Glu Leu Met Asp Ala Gin Met Lys 

216 145 ' 150 155 160 

218 Thr Phe Asp Thr Thr Phe Ser His Phe Lys Asn Phe Arg Leu Pro Gly 

219 165 170 175 

221 Val Leu Ser Ser Gly Cys Glu Leu Pro Glu Pro Leu Gin Ala Pro Ser 

222 180 185 190 

224 Arg Glu Glu Ala Ala Lys Trp Ser Gin Val Arg Lys Asp Leu Cys Ser 

225 195 200 205 

227 Leu Lys Val Ser Leu Gin Ala Ala Gly Gly Gly Trp Gin Cys Leu Glu 

228 210 215 220 

W--> 230 Leu Gin Xaa Pro Ser Arg Glh Trp Arg Lys Glu lie Phe Ser Leu Leu 
231 225 230 235 240 

2 33 Pro His Met Ala Asp Met Ser Thr Tyr Met Phe Lys Gly lie lie Ser 
234 245 250 255 

236 Phe Ala Lys Val lie Ser Tyr Phe Arg Asp Leu Pro lie Glu Asp Gin 

237 260 265 270 

2 39 lie Ser Leu Leu Lys Gly Ala Ala Phe Glu Leu Cys Gin Leu Arg Phe 
240 275 280 285 

242 Asn Thr Val Phe Asn Ala Glu Thr Gly Thr Trp Glu Cys Gly Arg Leu 

243 290 295 300 

24 5 Ser Tyr Cys Leu Glu Asp Thr Ala Gly Gly Phe Gin Gin Leu Leu Leu 
246 305 310 315 320 

24 8 Glu Pro Met Leu Lys Phe His Tyr Met Leu Lys Lys Leu Gin Leu His 
249 325 330 335 

251 Glu Glu Glu Tyr Val Leu Met Gin Ala lie Ser Leu Phe Ser Pro Asp 

252 340 345 350 

254 Arg Pro Gly Val Leu Gin His Arg Val Val Asp Gin Leu Gin Glu Gin 

255 355 360 365 

2 57 Phe Ala lie Thr Leu Lys Ser Tyr lie Glu Cys Asn Arg Pro Gin Pro 
258 370 375 380 

260 Ala His Arg Phe Leu Phe Leu Lys lie Met Ala Met Leu Thr Glu Leu 

261 385 390 395 400 

263 Arg Ser lie Asn Ala Gin His Thr Gin Arg Leu Leu Arg lie Gin Asp 

264 405 410 415 

266 lie His Pro Phe Ala Thr Pro Leu Met Gin Glu Leu Phe Gly lie Thr 

267 420 425 430 
269 Gly Ser 

272 <210> SEQ ID NO : 3 

273 <211> LENGTH: 25 

274 <212> TYPE: DNA 

275 <213> ORGANISM: Artificial Sequence 
2 77 <220> FEATURE: 



file://C:\Crf3\Outhold\VsrI840008.htm 



Page 5 of 7 



RAW SEQUENCE LISTING 
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278 <223> OTHER INFORMATION: Description of Artificial Sequence: Putative SXR 



286 <210> SEQ ID NO : 4 

287 <211> LENGTH: 25 

288 <212> TYPE: DNA 

289 <213> ORGANISM: Artificial Sequence 

291 <220> FEATURE: 

292 <223> OTHER INFORMATION: Description of Artificial Sequence: Putative SXR 

2 93 response element from the steroid hydoxylase, 
294 rCYP3A2 

296 <400> SEQUENCE: 4 

297 taagcagttc ataaagttca tctac 25 

300 <210> SEQ ID NO: 5 

301 <211> LENGTH: 25 

302 <212> TYPE: DNA 

303 <213> ORGANISM: Artificial Sequence 

3 05 <220> FEATURE: 

306 <223> OTHER INFORMATION: Description of Artificial Sequence: Putative SXR 

307 response element from the steroid hydoxylase/ 
308. rUGTlA6 

310 <400> SEQUENCE : 5 

311 actgtagttc ataaagttca catgg 25 

314 <210> SEQ ID NO: 6 

315 <211> LENGTH: 26 

316 <212> TYPE: DNA 

317 <213> ORGANISM: Artificial Sequence 

319 <220> FEATURE: 

320 <223> OTHER INFORMATION: Description of Artificial Sequence: Putative SXR 

321 response element from the steroid hydoxylase, 

322 rbCYP2Cl 

324 <400> SEQUENCE: 6 

325 caatcagttc aacagggttc accaat 26 

328 <210> SEQ ID NO : 7 

329 <211> LENGTH: 33 

330 <212> TYPE: DNA 

331 <213> ORGANISM: Artificial Sequence 
3 33 <220> FEATURE: 

334 <223> OTHER INFORMATION: Description of Artificial Sequence: Putative SXR 

335 response element from the steroid hydoxylase, 

336 rP450R 

338 <400> SEQUENCE: 7 

339 cacaggtgag ctgaggccag cagcaggtcg aaa 33 

342 <210> SEQ ID NO : 8 

343 <211> LENGTH: 27 

344 <212> TYPE: DNA 

345 <213> ORGANISM: Artificial Sequence 



279 
280 
282 
283 



response element from the steroid hydoxylase, 

rCYP3Al 
<400> SEQUENCE: 3 
tagacagttc atgaagttca tctac 



25 




using n or Xaa. 
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VERIFICATION SUMMARY DATE: 03/05/2002 

PATENT APPLICATION: US/09/840, 008 TIME: 13:59:08 
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L 


112 


M 


341 


W 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


1 


L 


113 


M 


341 


W 


(46) 


n n n 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


1 


L 


230 


M 


341 


W 


(46) 




or 


"Xaa" 


used, 


for 


SEQ 


ID# 


2 


L 


545 


M 


341 


w 


(46) 


"n n 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


22 


L 


564 


M 


341 


w 


(46) 


"n" 


or 


n Xaa" 


used, 


for 


SEQ 


ID# 


23 
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